An Aggregate Analysis of Pronunciation in the Goeman-Taeldeman-Van Reenen-Project Data
نویسندگان
چکیده
Contemporary Dutch dialects are compared using the Levenshtein distance, a measure of pronunciation difference. The material consists of data from the most recent Dutch dialect source available: the Goeman-Taeldeman-Van Reenen-Project (GTRP). This data consists of transcriptions of 1876 items for 613 localities in the Netherlands and Belgium gathered during the period 1980 – 1995. In addition to presenting the analysis of the GTRP, we compare the dialectal situation it represents to the Reeks Nederlands(ch)e Dialectatlassen (RND), in particular to the 350-locality sample studied by Heeringa (2004), noting areas of convergence and divergence. Although it was not the purpose of the present study to criticize the GTRP, we nonetheless note that transcriptions from Belgian localities differ substantially from the transcriptions of localities in the Netherlands, impeding the comparison between the varieties of the two different countries. We therefore analyze the developments in the two countries separately.
منابع مشابه
Dialect Pronunciation Comparison and Spoken Word Recognition
Two adaptations of the regular Levenshtein distance algorithm are proposed based on psycholinguistic work on spoken word recognition. The first adaptation is inspired by the Cohort model which assumes that the word-initial part is more important for word recognition than the word-final part. The second adaptation is based on the notion that stressed syllables contain more information and are mo...
متن کاملPhonological and Phonetic Databases at the Meertens Institute
The Meertens Institute in Amsterdam was founded in 1930 under the name ‘Dialect Bureau’ (Dialectenbureau), and became an official institute of the Royal Netherlands Academy of Arts and Sciences (KNAW) in 1952. In 1979, it was named after its first director, P.J. Meertens (1899-1985), a student of 17th century Dutch literature. Currently it comprises two departments, one of Dutch Ethnology and o...
متن کاملInducing Sound Segment Differences Using Pair Hidden Markov Models
Pair Hidden Markov Models (PairHMMs) are trained to align the pronunciation transcriptions of a large contemporary collection of Dutch dialect material, the GoemanTaeldeman-Van Reenen-Project (GTRP, collected 1980–1995). We focus on the question of how to incorporate information about sound segment distances to improve sequence distance measures for use in dialect comparison. PairHMMs induce se...
متن کاملA MULTI-OBJECTIVE OPTIMIZATION MODEL FOR PROJECT PORTFOLIO SELECTION CONSIDERING AGGREGATE COMPLEXITY: A CASE STUDY
Existing project selection models do not consider the complexity of projects as a selection criterion, while their complexity may prolong the project duration and even result in its failure. In addition, existing models cannot formulate the aggregate complexity of the selected projects. The aggregated complexity is not always equal to summation of complexity of projects because of possible syne...
متن کاملTotal and Partial efficiency indexes in data envelopment analysis
Introduction: Data envelopment analysis (DEA) is a data-oriented method for measuring and benchmarking the relative efficiency of peer decision making units (DMUs) with multiple inputs and multiple outputs. DEA was initiated in 1978 when Charnes, Cooper and Rhodes (CCR) demonstrated how to change a fractional linear measure of efficiency into a linear programming format. This non-parametric app...
متن کامل